Logic Programs for Repairing Inconsistent Dimensions in Data Warehouses

نویسندگان

  • Loreto Bravo
  • Mónica Caniupán Marileo
  • Carlos A. Hurtado
چکیده

A Data Warehouse (DW) is a data repository that integrates data from multiple sources and organizes the data according to a set of data structures called dimensions. Each dimension provides a perspective upon which the data can be viewed. In order to support an efficient processing of queries, a dimension is usually required to satisfy different classes of integrity constraints. In this paper, we study the problem of repairing a dimension when it fails to satisfy a set of two classes of integrity constraints: strictness constraints and covering constraints. We introduce the notion of minimal repair of a dimension in this context. A minimal repair is defined as a new dimension that is consistent with respect to the integrity constraints, which is obtained by applying a minimal amount of updates to the original dimension. We study the complexity of computing minimal repairs. Finally, we show how to characterize and compute minimal repairs of a dimension using Datalog programs with stable model semantics.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Repairing inconsistent dimensions in data warehouses

A dimension in a Data Warehouse (DW) is a set of elements connected by a hierarchical relationship. The elements are used to view summaries of data at different levels of abstraction. In order to support an efficient processing of such summaries, a dimension is usually required to satisfy different classes of integrity constraints. In scenarios where the constraints properly capture the semanti...

متن کامل

Efficient Algorithms for Repairing Inconsistent Dimensions in Data Warehouses

Dimensions in Data Warehouses (DWs) are usually modeled as a hierarchical set of categories called the dimension schema. To guarantee summarizability, this is, the capability of using pre-computed answers at lower levels to compute answers at higher levels, a dimension is required to be strict and covering, meaning that every element of the dimension must be connected to a unique ancestor in ea...

متن کامل

Autoepistemic Belief-revision for Integration of Mutually Inconsistent Knowledge

It is well known that the standard 3-valued logic programs with constraints can be inconsistent. Because of that we can not use it for a data integration where mutually inconsistent information comes from different data sources. We argue that a natural way to answer to this challenge, without collapsing all sentences into inconsistency, is by passing to 4-valued bilattice-based logic (with logi...

متن کامل

A Fast and Self-Repairing Genetic Programming Designer for Logic Circuits

Usually, important parameters in the design and implementation of combinational logic circuits are the number of gates, transistors, and the levels used in the design of the circuit. In this regard, various evolutionary paradigms with different competency have recently been introduced. However, while being advantageous, evolutionary paradigms also have some limitations including: a) lack of con...

متن کامل

Optimization Methods for Logic-based Query Answering from Inconsistent Data Integration Systems

Information integration systems providing the user with transparent access to heterogeneous data sources through a unified global view of all data, have emerged as a crucial issue in many application domains. This global view usually comprises integrity constraints which should be satisfied by the data retrieved from the sources. However, they might be often violated, and suitable approaches fo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010